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Amendments to the Claims: 

This listing of claims will replace all prior versions and listings of claims in this 
application: 

Listing of Claims: 

1 .(Previously presented) A processor-implemented method of 
clustering a set of records, each of the records having attribute values for a 
set of attributes, the method comprising: 

for each attribute of the set of attributes, determining a characteristic 
value for said each attribute, based on attribute values of said each 
attribute; 

wherein determining the characteristic value comprises calculating 
the attribute values of said attribute across the records; 

for each attribute value, determining a deviation from the 
characteristic value of said each attribute; 

for each record, sorting the set of attributes based on deviations of 
the attribute values, to provide a key; and 

clustering the set of records based on the key into a clustering result 
that includes a plurality of clusters; 

wherein the key comprises an ordered list of the set of attributes and 
the deviations; and 

refining the clustering result by selectively changing a length of the 
key to chonge the number of the clusters . 
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2. (Original) The method of claim 1, further comprising calculating a 
mean value of the attribute values of said each attribute as the 
characteristic value. 

3. (Original) The method of claim 1, wherein a median value of the 
attribute values of said each attribute is determined as the characteristic 
value. 

4. (Original) The method of claim 1 , wherein determining the deviation 
comprises calculating a difference between said each attribute value and 
the characteristic value of said each attribute. 

5. (Original) The method of claim 1 , wherein determining the deviation 
comprises calculating a difference between said each attribute value and 
the characteristic value of the corresponding attribute, and dividing the 
difference by the characteristic value of said each attribute. 

6. (Original) The method of claim 1, wherein sorting the set of attributes 
comprises using absolute values of the deviations of the attribute values as a 
sorting criterion. 

7. (Original) The method of claim 1 , wherein a first record of the set of 
records contains a first key and a second record of the set of records 
contains a second key; and 



3 



Application Serial No.: 10/700,908 
Filing Date: November 3, 2003 



Reply to Office action of: October 1 7, 2006 
Attorney Docket No.: DE920010103US1 



further comprising placing the first key and the second key into a 
single cluster if the first key and the second key have identical sub- 
sequences of a first length. 

8. (Original) The method of claim 1 , wherein a first record of the set of 
records contains a first key and a second record of the set of records 
contains a second key; and 

further comprising placing the first key and the second key into a 
single cluster if the first key and the second key have identical sub- 
sequences of absolute values of the deviations. 

9. (Original) The method of claim 1 , wherein a first record of the set of 
records contains a first key that has a first sub-sequence, and a second 
record has a second sub-sequence contains a second key; and 

further comprising placing the first key and the second key into a 
single cluster if the first and second sub-sequences comprise the same set of 
attributes. 

10. (Original) The method of claim 9, wherein the first and second sub- 
sequences comprise the same set of attributes irrespective of a sign of the 
deviations of the attribute values. 



4 



Application Serial No.: 10/700,908 
Filing Date: November 3, 2003 



Reply to Office action of: October 1 7, 2006 
Attorney Docket No.: DE920010103US1 



1 1 . (Currently amended) The method of claim 1 0, furth e r comprising 
wherein refining [[a]] the clustering result by- comprises : 

identifying a cluster having a smallest number of records; 

for each record of the identified cluster searching another cluster 
having records with best matching keys; and 

distributing the cluster with the smallest number of records to the other 
cluster having records with best matching keys, to reduce the total number 
of clusters. 

1 2. (Original) The method of claim 1 1 , further comprising reducing a 
length of the first sub-sequence and a length of the second sub-sequence in 
order to find a best match. 

13. (Original) The method of claim 12, further comprising using a 
distance measure to find another cluster for a record of the identified 
cluster. 

1 4. (Previously presented) The method of claim 1 3, wherein the 
distance measure comprises a Euclidean distance. 
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15. (Currently amended) A computer program product having 
executable instruction codes that are stored on a comput e r - usab le 
computer-readable medium, for clustering a set of records, each of the 
records having attribute values for a set of attributes, the computer program 
product comprising: 

a f4fs+ set of instruction codes, which, for each attribute of the set of 
attributes, determines a characteristic value for said each attribute, based 
on attribute values of said each attribute; 

wherein the characteristic value is calculated from the attribute 
values of said attribute across the records; 

a s e cond set of instruction codes, which, for each attribute value, 
determines a deviation from the characteristic value of said each attribute; 

a #w4 set on instruction codes, which, for each record, sorts the set of 
attributes based on deviations of the attribute values, to provide a key; and 

a fourth set of instruction codes for clustering the set of records based 
on the key into a clustering result that includes a plurality of clusters; 

wherein the key comprises an ordered list of the set of attributes and 
the deviations; and 

a set of instruction codes for refining the clustering result by selectively 
changing a length of the key to change the number of the clusters . 

1 6. (Currently gmended) The computer program product of cloim 1 5, 
further comprising o fifth set of instruction codes for colculgting g mean 
value of the attribute values of said each attribute as the characteristic 
value. 
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1 7. (Currently amended) The computer program product of claim 15, 
further comprising a sixth set of instruction codes for setting a median value 
of the attribute values of said each attribute as the characteristic value. 

1 8. (Currently amended) The computer program product of claim 1 5, 
wherein the s e cond set of instruction codes for determining the deviation 
from the characteristic value determines the deviation by calculating a 
difference between said each attribute value and the characteristic value 
of said each attribute. 

19. (Currently amended) The computer program product of claim 15, 
wherein the s e cond set of instruction codes for determining the deviation 
from the characteristic value determines the deviation by calculating a 
difference between said each attribute value and the characteristic value 
of the corresponding attribute, and by dividing the difference by the 
characteristic value of said each attribute. 

20. (Currently amended) The computer program product of claim 15, 
wherein the thtfd set en- of instruction codes for sorting the set of ottributes 
sorts the set of ottributes using absolute values of the deviations of the 
attribute values as a sorting criterion. 
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21 . (Currently amended) A processor-implemented system for 
clustering a set of records, each of the records having attribute values for a 
set of attributes, the system comprising: 

each attribute of the set of attributes comprising a characteristic 
value for said each attribute based on attribute values of said each 
attribute; 

wherein the characteristic value is calculated from the attribute 
values of said attribute across the records; 

each attribute value comprising a deviation from the characteristic 
value of said each attribute; 

each record comprising the set of attributes based on deviations of 
the attribute values, to provide a key; and 

wherein the set of records are clustered based on the key ; into a 
clustering result into a clustering result that includes a plurality of clusters; 

wherein the key comprises an ordered list of the set of attributes and 
the deviations; and 

means for refining the clustering result by selectively chgnging a 
length of the key to change the number of the clusters . 

22. (Origingl) The system of claim 21 , wherein a mean value of the 
attribute values of said each attribute is calculated as the characteristic 
value. 

23. (Original) The system of claim 21 , wherein a median value of the 
attribute values of said each attribute is calculated as the characteristic 
value. 
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24. (Original) The system of claim 21 , wherein the deviation is 
calculated as a difference between said each attribute value and the 
characteristic value of said each attribute. 

25. (Original) The system of claim 21 , wherein the deviation is 
determined by calculating a difference between said each attribute value 
and the characteristic value of the corresponding attribute, and by dividing 
the difference by the characteristic value of said each attribute. 

26. (Original) The system of claim 21 , wherein the set of attributes is 
sorted using absolute values of the deviations of the attribute values as a 
sorting criterion. 
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